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Abstract — A Relay Station (RS) uses a buffer to store and 
process the received data packets before forwarding them. 
Recently, the buffer has been exploited in one-way relaying to 
opportunistically schedule the two different links according to 
their channel quality. The intuition is that, if the channel to 
the destination is poor, then RS stores more data from the 
source, in order to use it when the channel to the destination is 
good. We apply this intuition to the case of half-duplex two-way 
relaying, where the interactions among the buffers and the links 
become more complex. We investigate the sum-rate maximization 
problem in the Time Division Broadcast (TDBC): the users send 
signals to the RS in different time slots, the RS decodes and 
stores messages in the buffers. For downlink transmission, the 
RS re-encodes and sends using the optimal broadcast strategy. 
The operation in each time slot is not determined in advance, 
but depends on the channel state information (CSI). We derive 
the decision function for adaptive link selection with respect 
to CSI using the Karush-Kuhn-Tucker (KKT) conditions. The 
thresholds of the decision function are obtained under Rayleigh 
fading channel conditions. The numerical results show that the 
sum-rate of the adaptive link selection protocol with buffering is 
significantly larger compared to the reference protocol with fixed 
transmission schedule. 

I. Introduction 

Wireless networks with relays are a subject of intense 
research interest. Capacity bounds and various cooperative 
strategies for relay networks have been studied in JT1. p] de- 
veloped Decode-and-Forward (DF) relaying to multiple access 
relay channels and broadcast relay channels, and generalized 
Compress-and-Forward (CF) relaying to multiple relays. A 
paradigm shift in communicating multiple flows through relays 
occurred with the concept of network coding [3 1, where a relay 
transmits functions of the incoming communication flows, 
rather than only replicating the incoming flows. This idea has a 
particularly promising application in wireless networks, where 
it was shown that two-way relaying can be improved when the 
relay uses wireless network coding Q, (5). 

In an information-theoretic framework, scheduling of the 
different links of a relay system is usually not questioned. 
For example, in a one way relaying, the source first sends its 
data to the relay station (RS) and the RS forwards it. This 
operation is repeated sequentially. In a practical system, the 
relay needs a buffer to store the received packets, in order 
to process them before forwarding. Such a buffering gives an 
opportunity for a clever scheduling that exploits the channel 
state information (CSI), as proposed in |6). If the link from 
the RS to the destination is weak, it might be beneficial not 
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Figure 1. TDBC two-way transmission model 

to forward any data but instead accumulate more data from 
the source, and wait for a better channel to forward any data. 
The advantage of a scheduling at the relay that is not fixed 
in advance was pointed out in JT), where it is proven that 
a protocol with random schedule is better than a protocol 
with a fixed schedule. The works [8] and [9] treat the one- 
way relaying scenario and reveal that buffer-aided protocols 
determined by instantaneous CSI outperforms conventional 
relaying protocols with fixed transmission time slots. 

In this paper we generalize the concept of buffer-aided 
relaying to the case of two-way relaying. The presence of 
two communication flows and buffers significantly changes the 
optimization problem. We focus on the sum-rate maximization 
problem in the Time Division Broadcast (TDBC) two-way 
relay network with buffering as shown in Fig. [T] We use 
the optimal broadcast strategy introduced in [ 10] for two-way 
relaying with side information at the terminals. We solve the 
optimization problem by relaxing the discrete constraints and 
applying the Karush-Kuhn-Tucker (KKT) condition. The nu- 
merical result shows that the sum-rate of the proposed adaptive 
link selection protocol with buffering significantly exceeds the 
reference protocol with fixed transmission schedule. 

II. System Model 

Consider a two-way relay network in which two users (UO 
and U2) intend to exchange information with the aid of a 
relay station (RSI) as Fig. [T]shows. RSI applies Decode-and- 
Forward (DF) and it has two buffers for storing messages, one 
for each of the users, assumed to be of unlimited size. All the 
nodes are half-duplex, such that a node can either transmit 
or receive at a given time. We assume that there is no direct 
link between UO and U2. The elementary transmission unit 
is a time slot of fixed duration. We assume a block fading 
channel, such that the channel is constant over the duration 



of one time slot but changes independently from one slot to 
another. hoi(i) and hx 2 (i) are the channels of link U0-^RS1 
and RSI— ^U2 in time slot i respectively. In every time slot 
i, all the nodes know the instantaneous channels which are 
assumed to be reciprocal. P is the transmission power for 
each node. The noise at all receivers Zj ~ CJV(0, a 2 ), j G 
{0,1,2} is an independent Additive White Gaussian Noise 
(AWGN) with zero mean and variance a 2 . The instantaneous 
Signal-to-Noise Ratio (SNR) for link UOoRSl and RSloU2 
is 7o (*) = p| y 2 and 72 (t) = respectively. The 

expected value of the SNR is fi; = E{-/i(i)},l G {0,2}. 
When user I transmits in slot i, the maximal instantaneous rate 
is C\ (i) = log 2 (1 + 7; (i)). The transmission from RSI is a 
broadcast process with side information at two terminals (U0 
and U2). From 1 10 1, the maximal broadcast rate for each link 
is Co (i) and C 2 (i), i.e. the maximal instantaneous achievable 
rate for each individual link. 

III. Transmission protocol design 
A. Operation of the buffers 

RSI has two buffers Qo and Q 2 to store the decoded 
messages from U0 and U2 respectively. At a specific time slot 
i, only one node 0, 1 and 2 has the possibility to transmit a 
signal. The transmission schedule is determined at each time 
slot and depends on the instantaneous CSI and the buffers' 
state. The variable pj (i) G {0, 1} indicates whether node 
j G {0,1,2} transmits in time slot i (Pj (i) — 1) or not 
(pj (i) — 0). Obviously, po(i) + Pi(i) + Pa(i) = 1 nas t0 
be satisfied. In slot i, the buffers are updated as follows. 

• If U0 transmits: 

Qo(») = Qo(*-l)+Cb(i), 02(i) = Q 2 (»-l). (1) 

• If U2 transmits: 

Qo(t) = Qo(*-l), Q^{i)=Qi{i-l) + C 2 {i). (2) 

• If RSI transmits: 

Qo (i) = Qo (i - 1) - min {C 2 (i) , Q (» - 1)} (3a) 
Q 2 (») = Q 3 (* - 1) - min {Co (*) , Q 2 (i - 1)} . (3b) 

The optimal transmission strategy should satisfy: 

E{Qi (i) -Qi(i- 1)} = 0, 1 = 1,2 (4) 

indicating that the number of bits in each buffer should be 
stable for optimal operation. 

Incorporating ([lJ-Q and considering the transmission in- 
dicator Pj(i) of each node, we get the criterion for optimal 
transmission as follow 

E {po«C (t)} = E { Pl (i) min {C 2 (») ,Q (i- 1)}} (5a) 
E {p 2 «C 2 (t)} = E { Pl (i) min {Co (») , Q 2 (i - 1)}} (5b) 

Based on Q and similar to (8| Theorem 2], the impact of 
C 2 (i) > Qo (i — 1) and C (i) > Q 2 (i — 1) is negligible over 
a very long period and <|3j becomes 

E{po(i)Co{i)} = E{ Pl {i)C 2 (i)} (6a) 

S te(i)C 2 (i)} = E { Pl {i)Co (t)} . (6b) 



B. /TAT condition for maximizing the system sum-rate 

Our goal is to maximize the average sum-rate of the two- 
way relaying system under the buffer stability conditions |6|. 
We consider an observation window of TV time slots for which 
we define the target function / (p) and the constraint functions 
hk(p),k G 0, k = l,...,A r + 2 in 0. N is assumed to 
be asymptotically large so that, assuming ergodicity, the time 
average is equivalent to the ensemble average: 
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\p (i)C (i)-pi(i)C 2 (i)] (7b) 
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h 2+i (p) = 1 -po(i) -Pi(i) -P2(i) 



(7c) 
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here p = (p (l), Pl (l),p 2 (l), ..., Po (N), Pl (N),p 2 (N)). 

Our optimization problem consists in maximizing the aver- 
age sum-rate over the transmission indicators P j(i): 



s.t. 



max /(p) 
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h k (p) = 0, k ■ 



l,...,A r + 2, A^-^oo (8) 
= 0, jG{0,l,2}, i€{l,...,N} 



Note thatpj(i) is binary, so the optimization problem above 
is over a discrete domain. We therefore relax the constraints, 
by assuming that P j(i) takes continuous values within [0, 1]. 
In this new formulation, the constraint Pj(i) [1 — Pj(i)] = 
is replaced by < Pj(i) < 1. We can now resort to the 
KKT conditions. Equivalence of this formulation is proved 
subsequently where we show that the optimal points are on 
the border of the region [0, 1] which coincides with the fact 
that the Pj(i) takes binary values. The optimization problem 
is formulated as follows: N 

V/ (p*) - AV/n (p*) - AiV/i 2 (p*) - "iV/i2+i (P*)+ 

2 JV 2 N 

J2 E ^ v t 1 - pi (*)] + E E ^ v fj « = < 9a > 

o< P *(i)<i, je {0,1,2} (%) 

(9c) 



=o, >o 

^ (j) p*W = o, #>o 

^2+i(p*) =0, i=l,...,JV, A^oo 
fc fc (p*)=0, k = l,2 



(9d) 
(9e) 
(9f) 



The KKT necessary conditions state that if p* is a local 
optimum, there exist constant coefficients A, p,, a,, /3,p } and 
ip^ such that ([ij is satisfied. Furthermore, because the target 
function and constraint functions in Q are linear, the KKT 
necessary conditions are also sufficient conditions and a local 

maximal point is the global maximal point as well. This means, 

(i) (i) 

if there exists constant coefficients A, \i, on, fiX and ^ 



satisfying (j9j) for point p*, then /(p*) must be the global 
maximum. 

C. The decision function for adaptive link selection 
Proposition 1: The global optimal decision function is 

A + l ^ Cg(i) 



P5(») = 1, when - ^ < %W 
rt(i) = l, when -A±H+i<gg<__A±i_ . (10 , 
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The proof is based on KKT conditions (9ai-(9ei and provided 
in Appendix A. Next, we will use the KKT condition ( |9~f| ) to 
get the thresholds A, /1 in the decision function. 

The KKT condition ( |9f) is equivalent to |6]). The probability 

density function for 7; (i) is / (7;) = ?^e - "^ , 7; > 0, 1 = 1, 2. 
From ( flu) , we have 
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here £ x = (72 + 1) A +^ +1 - 1, L 2 = (70 + 1) A +*'+ 1 - 1, 

_ A + M + l _ A + n + 1 

L 3 = (7o + 1) x+1 - 1, £4 = (72 + 1) " +1 - 1. Sub- 
stitutingdTTji into |6]), and simplifying the integral equations 
using dT2b, we can find a numerical solution for A and p, 
which are used in section [V] to determine the sum rate. 

b 

J In (1 + x) ^dx = e-ft In (1 + a) - en In (1 + b) 



(12) 



IV. Reference system 



The transmission schedule of the reference system is fixed 
and determined in advance. The transmissions from U0, U2 
and RSI are done sequentially and have durations to, t% and t\ 
respectively. The transmissions span over many channel fades 
so that each link is assumed to achieve the ergodic capacity. 
The ergodic capacity of link 7; is 



C.(l)= / logjfl+n)/ ("»)*)> 



Furthermore, RSI is assumed to be equipped with infinitely 
large buffers. 

The broadcast period is t\ = max j °q^° , %^oT"} - ^ e 
sum-rate of this system is R r f — C ' g ^^^^ t2 ■ Optimizing 



w.r.t. t ,ti,t 2 , the maximal sum-rate of the reference system 
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Figure 2. Sum-rate under different SNR condition 
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Figure 3. Performance of finite buffer size 



V. Numerical results 

Setting fi = lOdB or O = 20dB and ft 2 /fi from °- 1 t0 
10, we solve the integral equations (111 to get the thresholds 
A, /1 and the theoretical results of the sum-rate shown in 



Fig. [2] Applying the thresholds A, \i obtained from the integral 
equations and the decision function of ( 10 1 into the simulation, 
we get the simulation results of the sum-rate shown in Fig. [2] 
We observe that the theoretical results and the simulation 
results coincide. Whatever the SNRs 70 and 72, the sum-rate 
of the proposed adaptive link selection protocol is larger than 
the sum-rate of the reference system achieving the ergodic 
capacity for each single link. 

We now show the effect of a limitation in the buffer size. 
We find the optimal A, p from the integral equations and the 
decision function in ( [T0| but with the following additional 
constraint: if Ci (i)+Qi (i — 1) > maxQ;, link I is not chosen. 
The sum-rate for finite buffer size is denoted as R{ um - Fig. [3] 
shows the simulation results as a function of the buffer size 
when the buffer sizes of Qq and Q2 are equal. 



VI. Conclusion 

We exploit the presence of a buffer at the relay of a 
TDBC two-way relay network to opportunistically schedule 
the communication links maximizing the system sum-rate and 
guaranteeing buffer stability conditions. We use the KKT 
approach to derive the decision function for adaptive link 
selection under a Rayleigh fading assumption. The numeri- 
cal results show that the proposed protocol outperforms the 
reference protocol where scheduling is determined in advance 
and does not depend on the channel state information. An 
interesting issue for future work is to investigate two-way 
relaying with a common buffer of a limited size, as well as 
the impact of the direct link between the users, along with a 
compression-and-forward relaying strategy. 

Appendix A 



The gradient equation (9ai is equivalent to the following 
equations for Vi 

- AC (i) +cti- /3- 0) + t/>| 0) = (13a) 

- fiC 2 (i) + a, - pf ) + 4 2) = (13b) 
(/x + 1) Co (i) + (A + 1) C 2 (*) +ai- ^ 1) +ipf ) =Q. (13c) 



In Appendix B, we prove that the solutions Pj(i) of ( 13 i can 
only be or 1. 

We first examine the case p^(i) = 1, p\{i) = 0, p^{i) = 0. 



Considering (9c 



(0) 



and (9di, we have j3. 



(1) _ o(2) _ 



respectively. Then Subtracting ( 1 3a i from (13bi and 



( 13c i, we get 



l3\> = and 



ACo (i) - pC 2 (i) + 4 2) + = 



(A + n + 1) C (i) + (A + 1) C 2 (t) + W+fr 



W,a(o). 



=0. 



(14a) 
(14b) 



From (9c i and (9di, , ipP , ffl > must be satisfied. 
Hence the necessary conditions for p* (i) = 1 are as follow 

AC (*) - mC 2 (i) < (15a) 
(X + fj, + l)C (i) + (X + l)C 2 {i)<0. (15b) 

Obviously if inequalities (JT3J are satisfied, we definitely can 
find appropriate coefficients ip^ 1 \ip^ i@i > verifying 
( 

( 



As the only constraints on a* are contained in ( 13 I, if 
is satisfied, we can find an appropriate a; to meet <j 1 3]> 
as well. So ( 15 i is also the sufficient conditions for Pq(i) = 1. 

Similarly, for p 2 (i) = 1 and p*(i) — 1, the equivalent 
conditions are ( [To] ) and ( fT7j ) respectively. 

ACo (i) - MC 2 (i) > (16a) 

(/x + 1) C (i) + (A + fi + 1) C 2 (i) < 0. (16b) 

(A + /i+l)C (i) + (A + l)C 2 (z)>0 (17a) 

(/x + 1) C (*) + (A + m + 1) C 2 (t) > 0. (17b) 



Next, we simplify conditions ( 15p"6} by removing the 
redundant equations. From (15 1, (|16[) and (17 1, we can infer 
— 1 < A, p < and A + /x + 1 < 0. The proof is deferred 



to Appendix C. By cancelation Co (i) and C 2 (i) in ( 17 1 we 
obtain 



A, fi are constant parameters (do not depend on time index 
i), so ( fTS) should also be valid in conditions (JT3J and ( fTB} . 
Hence, when ( 15b i is verified, so is ( 15a ». Similarly, when 
( |16b| i is verified, so is ( |16a[ ). 

Hence, we have proven that if (TTOl is valid, we can find 

(?) (i) — 

appropriate coefficients ipy ,p\ > and a; to meet the 
KKT condition d9ab ~ 



Appendix B 

If there exists one non-binary number among p${i), p\{i) 
and p 2 (i), there are at least two non-binary numbers among 
p* Q (i), pl(i) and p* 2 {i), otherwise p* Q (i) +p$(i) + p* 2 (i) = 1 
can not be satisfied. We assume Pq(i) and p 2 (i) are non-binary 
for instance, the proof for other cases is similar. Then p\(i) 
can not be 1. From (9ci and (9di, we obtain (3^ = m 1 ' = 
/Jj- 2 " 1 = and ipf^ = tp: 2 ' = respectively. Substituting the 
above results into ( | 1 3a[ > and ( |13b| i which are valid for Vi, we 
get A = fi = a, = 0. Then considering ( |13c| i, we obtain 
C (*) + C 2 (i) + ip^ = 0. However, C (i) , C 2 (i) > and 
> should be satisfied, which is a contradiction. So 
Po(i), P*(i) and p 2 (i) are on the border of [0, 1]. 

Appendix C 

Note that Co (i) , C 2 (i) > is always satisfied. A and /x 
must have the same sign, otherwise ( | 15a| > or ( |16a| i will be 
valid constantly which means U0 or U2 will transmit in every 
time slot. A, fi < must be satisfied, otherwise ( |15b[ ) and ( |16b[ ) 
will never happen which means RSI will broadcast in every 
time slot. And —1 < X, fi also should be satisfied, otherwise 
( fTT] ) will never happen which means there is no broadcast at 
all. So we get -1 < A, fi < 0. Similarly, A + \i + 1 < is 
valid, otherwise RSI will broadcast in every time slot. 
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A + /x + A/x + A + /j < 0. 
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